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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/919,854 

Input Set : A:\211739US0X.ST25.txt 
Output Set: N:\CRF4\01162003\I919854.raw 

<110> APPLICANT: FARWICK, MIKE 
HUTHMACHER, KLAUS 
BREHME, JENNIFER 
PFEFFERLE, WALTER 
BINDER, MICHAEL 
GREISSINGER, DIETER 
THIERBACH, GEORG 

<120> TITLE OF INVENTION: Nucleotide sequences which code for the sahH gene 
<130> FILE REFERENCE: 211739US0X 
<140> CURRENT APPLICATION NUMBER: 09/919,854 
<141> CURRENT FILING DATE: 2002-12-30 
<150> PRIOR APPLICATION NUMBER: DE 100 44 706.6 
<151> PRIOR FILING DATE: 2000-09-09 
<150> PRIOR APPLICATION NUMBER: DE 101 09 685.2 
<151> PRIOR FILING DATE: 2001-02-28 
<150> PRIOR APPLICATION NUMBER: US 60/294,277 
<151> PRIOR FILING DATE: 2001-05-31 
<160> NUMBER OF SEQ ID NOS : 4 
<170> SOFTWARE: Patentln version 3.1 
SEQ ID NO: 1 
LENGTH: 1939 
TYPE: DNA 

Corynebacterium glutamicum 



<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<222> 
<223> 
<400> 



ORGANISM 
FEATURE : 
NAME/KEY: CDS 
LOCATION: (227) . . (1720) 
OTHER INFORMATION: 
1 

tcgcggccat catgtttggc gctccatatt ctgcgggcca ggtgctggag aattacccat 
ttatccgcga actcgtcggt gagggctctg aagctcttcg cggtgctgca gcgcaggttt 
tggaagaggc agatgtggaa tatgacctcg aagcttattt agaggccctc aactagccct 
ccactaaaca gcttcaatca attcggtgtc cactccaaca tgtaga gtg gtg cgc 

Val Val Arg 
1 

gtt aaa aaa gtt ttc eta att ttc att ttc tta aaa gga get cgc cag 
Val Lys Lys Val Phe Leu lie Phe lie Phe Leu Lys Gly Ala Arg Gin 

5 10 15 

gac atg gca cag gtt atg gac ttc aag gtt gec gat ctt tea eta gca 
Asp Met Ala Gin Val Met Asp Phe Lys Val Ala Asp Leu Ser Leu Ala 
20 25 30 35 

gag gca gga cgt cac cag att cgt ctt gca gag tat gag atg cca ggt 
Glu Ala Gly Arg His Gin lie Arg Leu Ala Glu Tyr Glu Met Pro Gly 
40 45 50 
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235 
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64 etc atg cag ttg cgc aag gaa ttc gca gac gag cag cct ttg aag ggc 427 

65 Leu Met Gin Leu Arg Lys Glu Phe Ala Asp Glu Gin Pro Leu Lys Gly 

66 55 60 65 

68 gec cga att get ggt tct ate cac atg acg gtc cag acc gec gtg ctt 475 

69 Ala Arg He Ala Gly Ser He His Met Thr Val Gin Thr Ala Val Leu 

70 70 75 80 

72 att gag acc etc act get ttg ggc get gag gtt cgt tgg get tec tgc 523 

73 He Glu Thr Leu Thr Ala Leu Gly Ala Glu Val Arg Trp Ala Ser Cys 

74 85 90 95 

76 aac att ttc tec acc cag gat gag get gca gcg get ate gtt gtc ggc 571 

77 Asn He Phe Ser Thr Gin Asp Glu Ala Ala Ala Ala He Val Val Gly 

78 100 105 110 115 

80 tec ggc acc gtc gaa gag cca get ggt gtt cca gta ttc gcg tgg aag 619 

81 Ser Gly Thr Val Glu Glu Pro Ala Gly Val Pro Val Phe Ala Trp Lys 

82 120 125 130 

84 ggt gag tea ctg gag gag tac tgg tgg tgc ate aac cag ate ttc age 667 

85 Gly Glu Ser Leu Glu Glu Tyr Trp Trp Cys He Asn Gin He Phe Ser 

86 135 140 ' 145 

88 tgg ggc gat gag ctg cca aac atg ate etc gac gac ggc ggt gac gee 715 
8 9 Trp Gly Asp Glu Leu Pro Asn Met He Leu Asp Asp Gly Gly Asp Ala 
90 150 155 160 

92 acc atg get gtt att cgc ggt cgc gaa tac gag cag get ggt ctg gtt 763 

93 Thr Met Ala Val He Arg Gly Arg Glu Tyr Glu Gin Ala Gly Leu Val 

94 165 170 175 

96 cca cca gca gag gec aac gat tec gat gag tac ate gca ttc ttg ggc 811 

97 Pro Pro Ala Glu Ala Asn Asp Ser Asp Glu Tyr He Ala Phe Leu Gly 

98 180 185 190 195 

100 atg ctg cgt gag gtt ctt get gca gag cct ggc aag tgg ggc aag ate 859 

101 Met Leu Arg Glu Val Leu Ala Ala Glu Pro Gly Lys Trp Gly Lys He 
102, 200 205 210 

104 get gag gec gtt aag ggt gtc acc gag gaa acc acc acc ggt gtg cac 907 

105 Ala Glu Ala Val Lys Gly Val Thr Glu Glu Thr Thr Thr Gly Val His 

106 215 220 225 

108 cgc ctg tac cac ttc get gaa gaa ggc gtg ctg cct ttc cca gcg atg 955 

109 Arg Leu Tyr His Phe Ala Glu Glu Gly Val Leu Pro Phe Pro Ala Met 

110 230 235 240 

112 aac gtc aac gac get gtc acc aag tec aag ttt gat aac aag tac ggc 1003 

113 Asn Val Asn Asp Ala Val Thr Lys Ser Lys Phe Asp Asn Lys Tyr Gly 

114 245 250 255 

116 acc cgc cac tec ctg ate gac ggc ate aac cgc gec act gac atg etc 1051 

117 Thr Arg His Ser Leu He Asp Gly He Asn Arg Ala Thr Asp Met Leu 

118 260 265 270 275 

120 atg ggc ggc aag aac gtg ctt gtc tgc ggt tac ggc gat gtc ggc aag 1099 

121 Met Gly Gly Lys Asn Val Leu Val Cys Gly Tyr Gly Asp Val Gly Lys 

122 280 2'85 290 

124 ggc tgc get gag get ttc gac ggc cag ggc get cgc gtc aag gtc acc 1147 

125 Gly Cys Ala Glu Ala Phe Asp Gly Gin Gly Ala Arg Val Lys Val Thr 

126 295 300 305 

128 gaa get gac cca ate aac get ctt cag get ctg atg gat ggc tac tct 1195 
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129 Glu Ala Asp Pro lie Asn Ala Leu Gin Ala Leu 

130 310 315 

132 gtg gtc acc gtt gat gag gcc ate gag gac gec 

133 Val Val Thr Val Asp Glu Ala lie Glu Asp Ala 

134 325 330 

136 gcg acc ggc aac aag gac ate att tec ttc 

137 Ala Thr Gly Asn Lys Asp lie lie Ser Phe 

138 340 345 

140 aag gat cac get ctg ctg ggc aac ate 

141 Lys Asp His Ala Leu Leu Gly Asn lie 



142 



360 



144 gat atg cat tec ctg ttg cac cgc gac 

145 Asp Met His Ser Leu Leu His Arg Asp 

146 375 380 
148 aag cca cag gtc gac gag ttc acc ttc 
14 9 Lys Pro Gin Val Asp Glu Phe Thr Phe 
150 390 395 

152 gtc ctg tec gaa ggt cgc ctg ttg 

153 Val Leu Ser Glu Gly Arg Leu Leu 

154 405 410 

156 cca tea ttt gtc atg tec aac tct 

157 Pro Ser Phe Val Met Ser Asn Ser 

158 420 425 
160 ate gaa ctg ttc caa aac gaa gga 

lie Glu Leu Phe Gin Asn Glu Gly 
440 

164 ctg cct aag gtt etc gac gaa aag 

165 Leu Pro Lys Val Leu Asp Glu Lys 

166 455 

168 etc ggc ggt cag etc acc gaa ctg 

169 Leu Gly Gly Gin Leu Thr Glu Leu 

170 470 475 

172 ggc gtt gac gtt gca ggc cca ttc 

173 Gly Val Asp Val Ala Gly Pro Phe 

174 485 490 
17 6 taatgattgt cagcattgag ggaatcgacg 
178 taacgcaggt tattgatgea aaagtccttg 
180 cccaattggc cgeggaagea ctccacggcc 
182 ccatggccac gcttttcgcc ctcgaccgcc 

185 <210> SEQ ID NO: 2 

186 <211> LENGTH: 498 

187 <212> TYPE: PRT 

188 <213> ORGANISM: Corynebacterium 
190 <400> SEQUENCE: 2 

192 Val Val Arg Val Lys Lys Val Phe 

193 1 5 

196 Ala Arg Gin Asp Met Ala Gin Val 

197 20 

200 Ser Leu Ala Glu Ala Gly Arg His 



ggt 
Gly 
365 
gac 
Asp 



gag 
Glu 
350 
cac 
His 



gtc 
Val 



161 
162 



tec acc 
Ser Thr 



aac ctt ggc 
Asn Leu Gly 

ttc gcc gat 
Phe Ala Asp 
430 

cag tac gag 
Gin Tyr Glu 

445 
gtg gca cgc 
Val Ala Arg 
460 

acc aag gag 
Thr Lys Glu 

aag ccg gag 
Lys Pro Glu 

gcgccggcaa 
cattcccacg 
geatgggega 
acttcgega 



glut ami cum 

Leu lie Phe 
10 

Met Asp Phe 
25 

Gin lie Arg 



Met Asp Gly Tyr Ser 
320 

gac ate gtg ate acc 
Asp He Val He Thr 
335 

cag atg etc aag atg 
Gin Met Leu Lys Met 
355 

ttt gat aat gag ate 
Phe Asp Asn Glu He 
370 

acc cgc acc acg ate 
Thr Arg Thr Thr He 
385 

ggt cgc tec ate ate 
Gly Arg Ser He He 
400 

aac gcc acc gga cac 
Asn Ala Thr Gly His 
415 

cag acc att gcg cag 
Gin Thr He Ala Gin 
435 

aac gag gtc tac cgt 
Asn Glu Val Tyr Arg 
450 

ate cac gtt gag get 
He His Val Glu Ala 
465 

cag get gag tac ate 
Gin Ala Glu Tyr He 
480 

cac tac cgc tac 
His Tyr Arg Tyr 
495 

aaacaccctg gttteggcat 
ttatgaaacc tcgattcacg 
cctcaccgac agcgcctacg 



He Phe Leu Lys Gly 
15 

Lys Val Ala Asp Leu 
30 

Leu Ala Glu Tyr Glu 



1243 



1291 



1339 



1387 



1435 



1483 



1531 



1579 



1627 



1675 



1720 



1780 
1840 
1900 
1939 
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296 


Thr 


Gly 


His 


Pro 


Ser 


Phe 


Val 


Met 


Ser 


Asn 


Ser 


Phe 


Ala 


Asp 


Gin 


Thr 


297 








420 










425 










430 







file://C:\CRF4\Outhold\VsrI9 1 9854.htm 



1/16/03 



Page 5 of 7 



RAW SEQUENCE LISTING DATE: 01/16/2003 
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300 He Ala Gin He Glu Leu Phe Gin Asn Glu Gly Gin Tyr Glu Asn Glu 

301 435 440 445 

304 Val Tyr Arg Leu Pro Lys Val Leu Asp Glu Lys Val Ala Arg He His 

305 450 455 460 

308 Val Glu Ala Leu Gly Gly Gin Leu Thr Glu Leu Thr Lys Glu Gin Ala 

309 465 470 475 480 

312 Glu Tyr He Gly Val Asp Val Ala Gly Pro Phe Lys Pro Glu His Tyr 

313 485 490 495 
316 Arg Tyr 

320 <210> SEQ ID NO: 3 

321 <211> LENGTH: 28 

322 <212> TYPE: DNA 

323 <213> ORGANISM: ARTIFICIAL SEQUENCE 

325 <220> FEATURE: 

326 <223> OTHER INFORMATION: SYNTHETIC DNA 

328 <400> SEQUENCE: 3 

329 gtggtacctt cggtgtccac tccaacat 28 

332 <210> SEQ ID NO: 4 

333 <211> LENGTH: 28 

334 <212> TYPE: DNA 

335 <213> ORGANISM: ARTIFICIAL SEQUENCE 

337 <220> FEATURE: 

338 <223> OTHER INFORMATION: SYNTHETIC DNA 

340 <400> SEQUENCE: 4 

341 catctagata ggcgctgtcg gtgaggtc 28 
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PATENT APPLICATION: US/09/919,854 TIME: 13:51:19 

Input Set : A:\211739US0X.ST25.txt 
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L:16 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
L:4.1 M:258 W: Mandatory Feature missing, <223> Blank for SEQ# : 1 , Line# : 39 
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